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Abstract 

A new coding and queue management algorithm is proposed for communication networks that employ linear 
network coding. The algorithm has the feature that the encoding process is truly online, as opposed to a block-by- 
block approach. The setup assumes a packet erasure broadcast channel with stochastic arrivals and full feedback, 
but the proposed scheme is potentially applicable to more general lossy networks with link-by-link feedback. The 
algorithm guarantees that the physical queue size at the sender tracks the backlog in degrees of freedom (also called the 
virtual queue size). The new notion of a node "seeing" a packet is introduced. In terms of this idea, our algorithm may 
be viewed as a natural extension of ARQ schemes to coded networks. Our approach, known as the drop-when-seen 
algorithm, is compared with a baseline queuing approach called drop-when-decoded. It is shown that the expected 
queue size for our approach is O (jz^) as opposed to Q ( J f° r the baseline approach, where p is the load 
factor. 

I. Introduction 

Digital fountain codes ([1], [2]) are a well-known solution to the problem of reliable communication over a 
packet erasure channel. They have low complexity and require no feedback, except to signal successful decoding of 
a block. However, fountain codes do not extend readily to a network setting. Consider a two-link tandem network. 
If the middle node applies a fountain code on coded packets it has received so far, this does not mean that the 
overall code from the sender to the receiver will have the properties of a fountain code. In this sense, the fountain 
code approach is not composable across links. A decode and re-encode scheme will be sub-optimal in terms of 
delay as pointed out by [3]. 

In comparison, the random linear network coding solution [4], is composable because it removes the need for 
decoding at intermediate nodes. This solution ensures that with high probability, the transmitted packet will have the 
innovation guarantee property, i.e., it will bring new information to every receiver, except in the case when the 
receiver already knows as much as the sender. Thus, every successful reception will bring a unit of new information. 
This scheme is shown to achieve capacity for the case of a multicast connection. 

This work is supported by the DARPA ITMANET grant and by the NSF grant CNS-0627021 (NeTS:NBD: XORs in the Air: Practical 
Wireless Network Coding) 
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However, both fountain codes and random linear network coding have the problem of decoding delay. Both 
schemes are block-based. While this works for a file download setting, many applications today need to broadcast 
a continuous stream of packets in real-time. The above schemes would segment the stream into blocks (also called 
generations) and process one block at a time. If playback can begin only after receiving a full block, then high 
throughput would require a large delay. 

This raises the interesting question - can playback begin even before the full block is received? In general, 
playback is possible till the point up to which all packets have been recovered, which we call the front of contiguous 
knowledge. Traditionally, no incentive is placed on decoding a part of the message using a part of the codeword. 
In a streaming application however, decoding older packets earlier reduces delay. Performance depends on not only 
how much data is transferred in a given time, but also which part of the data. In other words, we are more interested 
in packet delay than block delay. These issues have been studied by [5] and [6] in a point-to-point setting. However, 
in a network setting, the problem is not well understood. In related work, [7], [8] address the question of how many 
original packets are revealed before the whole block is decoded. However, playback delay depends on not just the 
number of recovered packets, but also the order in which they are recovered. 

Suppose we have full feedback, then reliable communication over a packet erasure channel can be achieved using 
Automatic Repeat reQuest (ARQ). This simple scheme achieves 100% throughput, in-order delivery and the lowest 
possible packet delay, and it is composable across links. In case of tandem networks, this is the optimal solution. 
However, link-by-link ARQ cannot achieve the multicast capacity of a general network. The well-known butterfly 
network is an example. Besides, ARQ does not work well with broadcast-mode links because retransmitting a 
packet that some receivers did not get is wasteful for the others that already have it. In contrast, network coding 
readily extends to broadcast-mode links and also achieves multicast capacity of any network. 

Our new scheme combines the benefits of network coding and ARQ by acknowledging degrees of freedom 
instead of original packets. (Here, degree of freedom refers to a new dimension in the appropriate vector space.) 
This new framework allows the code to incorporate receivers' states of knowledge and thereby enables the sender 
to control the evolution of the front of contiguous knowledge. The scheme may thus be viewed as a first step in 
feedback-based control of the tradeoff between throughput and decoding delay, along the lines suggested in [9]. 

This new kind of feedback is also useful in queue management. Consider a packet erasure broadcast channel. The 
network coding solution of [4] requires the sender to generate a linear combination using potentially all packets of 
a generation that have arrived so far. If feedback is used only to signal completion of a generation, then the sender 
will have to store the entire generation till it is decoded. If instead, receivers ACK each packet upon decoding, then 
the sender can drop packets that all receivers have decoded. 

However, even storing all undecoded packets may be suboptimal. Consider a situation where the sender has n 
packets and all receivers have received (n — 1) linear combinations: (P1+P2), (P2+P3), • ■ • , (Pn-i+Pn)- No packet 
can be decoded by any receiver, so the sender cannot drop any packet. However, the backlog in degrees of freedom 
is just 1. It would be enough if the sender stores any one of the pi's. The degrees of freedom backlog is also called 
the "virtual queue" ([10], [11]). We ideally want the physical queue to track the virtual queue. Now, even if the 
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An example of the drop-when-seen algorithm 



receivers get degrees of freedom at the specified rate, it is not clear when they would decode the original packets. 
Hence, the drop-when-decoded scheme will not achieve this goal. In this work, we show that we can achieve this 
goal if we allow ACKs on degrees of freedom. 

A. Our contribution and its implications 

We propose a new online coding and queue update algorithm that uses ACKs on degrees of freedom to guarantee 
that the physical queue size at the sender will track the backlog in degrees of freedom. We introduce the notion 
of a node "seeing" a message packet, which is defined as follows. (Note: in this work, we treat packets as vectors 
over a finite field.) 

Definition 1 (Seeing a packet): A node is said to have seen a packet p if it has enough information to compute 
a linear combination of the form (p + q), where q is itself a linear combination involving only packets that arrived 
after p at the sender. (Decoding implies seeing, as we can pick q = 0.) 

Our new scheme is called the drop-when-seen algorithm because a packet is dropped if all receivers have seen it. 
The intuition is that if all receivers have seen p, it is enough for the sender's transmissions to involve only packets 
beyond p. After decoding these packets, the receivers can compute q and hence obtain p as well. Therefore, even 
if the receivers have not decoded p, no information is lost by dropping it. 

Whereas ARQ ACKs a packet upon decoding, our scheme ACKs a packet when it is seen. This proves useful 
when there is a broadcast constraint. We present a deterministic coding scheme that guarantees that the coded 
packet, if received successfully, would simultaneously cause each receiver to see its next unseen packet. We 
will prove later that seeing a new packet translates to receiving a new degree of freedom. This means, the innovation 
guarantee property is satisfied and 100% throughput can be achieved. The example below explains this algorithm 
for a simple two-receiver case. Section HV-AI extends this scheme to more receivers. 

1 ) Example: Table Q] shows a sample of how the proposed idea works in a packet erasure broadcast channel with 
two receivers A and B. The sender's queue is shown after the arrival point and before the transmission point of 
a slot. In each slot, the sender picks the oldest unseen packet for A and B. If they are the same packet, then that 
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packet is sent. If not, their XOR is sent. This rule will cause both receivers to see their oldest unseen packet. In slot 
1, pi reaches A but not B. In slot 2, (pi ©P2) reaches A and B. Since A knows pi, it can also decode P2. As for 
B, it has now seen (but not decoded) pi. At this point, since A and B have seen pi, the sender drops it. This is fine 
because, B will eventually decode P2 (this happens in slot 4), at which time it can obtain pi. Similarly, as shown 
in the table, P2, P3 and P4 will be dropped in slots 3, 5 and 6 respectively. However, the drop-when-decoded 
policy will drop pi and P2 in slot 4, and P3 and P4 in slot 6. Thus, our new strategy clearly keeps the queue 
shorter. This is formally proved in Theorem Q] and Corollary [3] 
2) Implications of our new scheme: 

• The information deficit at a receiver is restricted to a window of packets that advances in a streaming manner 
and has a stable size (namely, the set of unseen packets). In this sense, the proposed encoding scheme is truly 
online. All receivers see packets in order. 

• The physical queue size is upper-bounded by the sum of the degrees of freedom backlog between the sender 
and receivers. Our scheme thus forms a natural bridge between the virtual and physical queue sizes. It can be 
used to extend results on the stability of virtual queues such as [10], [11], [12] to physical queues. 

• At most n packets are involved in the coded packet. This reduces the decoding complexity and the overhead 
for storing the coding coefficients. 

• We assume a single packet erasure broadcast channel. But, we believe our algorithm is composable and can 
be extended to a tandem network of broadcast links, and with suitable modifications, it can be applied to a 
more general setup like the one in [13]. 

• As for decoding delay, we can say that if a receiver receives a packet while it is a leader in terms of number 
of received degrees of freedom, then it will be able to decode all packets up to that point. Now, some seen 
packets might be decoded even before a receiver becomes a leader. The evolution of decoded packets needs 
further study. 

The scheme we proposed in [14] also showed that the physical queue tracks the virtual queues. However, unlike 
[14] our current work provides an explicit coding scheme that enables us to prove new results. In other work, [15] 
also combines feedback and coding to address decoding delay. However, their notion of delay ignores the order in 
which packets are decoded. Moreover, they do not consider a stream of stochastic arrivals. 

II. The setup 

A sender wants to broadcast a stream of packets to n receivers over a packet erasure broadcast channel. Time is 
slotted. We focus on linear codes - every transmission is a linear combination of packets from the incoming stream. 
A node can compute any linear combination whose coefficient vector is in the span of the coefficient vectors of 
previously received coded packets. This leads to the following definition. 

Definition 2 (Knowledge of a node): The knowledge of a node is the set of all linear combinations of original 
packets that it can compute, based on the information it has received so far. The coefficient vectors of these linear 
combinations form a vector space called the knowledge space of the node. 
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Fig. 1. Relative timing of arrival, service and departure points within a slot 

The sender has one physical queue with no preset size constraints. We use the notion of a virtual queue to 
represent the backlog in degrees of freedom between the sender and receiver. There is one virtual queue for each 
receiver. 

Definition 3 (Virtual queue): The size of the j th virtual queue is defined to be the difference between the 
dimension of the knowledge space of the sender and that of the j th receiver. 

Arrivals: Packets arrive into the sender's physical queue according to a Bernoulli process of rate A. An arrival 
at the physical queue translates to an arrival at each virtual queue. 

Service: The channel accepts one packet per slot. Each receiver receives the packet with no errors with probability 
/i or an erasure occurs with probability (1 — fi). Erasures occur independently across receivers and across slots. 
Receivers can detect erasures. We assume the innovation guarantee property holds. Then, we can map successful 
reception to service of the virtual queue. Thus, in each slot, every virtual queue is served independently with 
probability /i. Service of the physical queue will depend on the queue update scheme used. 

Feedback: In Algorithm 1, feedback is sent when a window of packets is decoded, in order to indicate successful 
decoding. For Algorithm 2, the feedback is needed in every slot to indicate an erasure. We assume perfect delay-free 
feedback. 

Timing: Figure Q] shows the relative timing of various events within a slot. For simplicity, we assume that 
the transmission, unless erased by the channel, reaches the receivers before they send feedback for that slot, and 
feedback from all receivers reaches the sender before the end of the same slot. Thus, the feedback incorporates the 
current slot's reception also. 

Let p := X/p. In what follows, we will compare the expected queue size for the baseline drop-when-decoded scheme 
and the new drop-when-seen scheme, asymptotically as p — > 1~. 



The coding scheme assumed is an online version of [4], with no preset generation size. The sender transmits a 
random linear combination of all packets currently in the queue. For any receiver, packets at the sender are unknowns, 
and each received packet is an equation in these unknowns. Decoding can happen whenever the difference between 
the number of equations and unknowns involved becomes zero. This difference is essentially the backlog in degrees 
of freedom, i.e., the virtual queue size. Thus, successful decoding at a receiver happens when the corresponding 



III. Algorithm 1 - Drop when decoded 
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Fig. 2. Markov chain for the size of a virtual queue. A := 1 — A; fj, := 1 — fi 

virtual queue becomes emptj^. Whenever a receiver is able to decode in this manner, it informs the sender. Based on 
this, the sender tracks which receivers have decoded each packet, and drops a packet if it has been decoded by all 
receivers. We assume the field size is large enough to ignore the probability that the coded packet is not innovative. 
We will now study the behavior of the virtual queues in steady state. But first, we introduce some notation: 

Q(t) := Size of the physical queue at the end of slot t 

Qj{t) := Size of the j th virtual queue at the end of slot t 
Figure [2] shows the Markov chain for Qj(t). If p < 1, the chain is positive recurrent. Its steady state distribution 
is given by [16]: 7Tfc = (1 — a)a k , k > 0, where a — X< }]Zx\ ■ Thus, the steady state expected size of any virtual 



queue is: 

oo 

Urn E[Q, (t)] = W = (1 - M) ' (D 

3=0 { P > 

Next, we analyze the physical queue size under this scheme. Let T be the time an arbitrary arrival in steady state 
spends in the physical queue before departure, excluding the slot in which the arrival occurs. The packet will not 
depart until each virtual queue has become empty at least once since its arrival. Let Dj be the time until the next 
emptying of the j th virtual queue after the new arrival. Then, T = max., Dj and so, E[T] > E[Z?j-]. Hence, we 
focus on E[Z3j]. 

We condition on the new arrival seeing state Qj = k before joining the queue. Then, the state at the end of the 
slot in which the packet arrives, is k if the channel is ON in that slot and (k + 1) otherwise. Now, Dj is simply the 
first passage time from the state at the end of that slot to state 0. It can be shown that the expected first passage 
time from state u to state for u > is given by T u ,q = — z^. Now, due to the property that Bernoulli arrivals 
see time averages (BASTA) [17], the arrival sees the same distribution for Qj as the steady state distribution given 
above. We can then find E[Dj] thus: 

oo 

E[Dj] = ^>(New arrival sees state k)E[Dj ;| State k] 

k=0 



= Y] [/-iTfc.o + (1 - Li)T k+h o] = • 7j v 



k=0 



If the chain is positive recurrent (p < 1), we can use Little's law to find steady state expected physical queue size: 
E[Q] = AE[T] > XE[Dj]. This leads to the following result: 

'it may be possible to find some unknowns even before the virtual queue becomes empty. However, this is a higher order effect and we 
ignore it. 
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Theorem 1: The expected size of the physical queue in steady state for Algorithm 1 is f2 




IV. Algorithm 2 - Drop when seen 

Algorithm 2 was briefly introduced in Section II- Al The algorithm uses the notion of reduced row echelon form 
(RREF) of a matrix in representing the knowledge of a receiver. The definition and properties of the RREF can be 
found in [18]. 

Let V be the knowledge space of some receiver. Suppose m packets have arrived at the sender so far. Then, V 
must be a subspace of F™ and can be represented using a dim(V) x m matrix over F g whose rows form a basis of 
V. Multiple representations are possible depending on the basis chosen. However, if we insist that the matrix must 
be in RREF, we get a unique representation. This unique RREF basis can be obtained by performing a Gaussian 
elimination on any other basis matrix. In the RREF basis, the first nonzero entry of any row is called a pivot. Any 
column with a pivot is called a pivot column. The number of pivot columns equals the number of nonzero rows, 
which is dim[V}. The k th packet to have arrived at the sender is said to have an index k and is denoted p^. The 
columns are ordered so that column k maps to packet pk. The next theorem connects the seen packets and the 
RREF basis. 

Theorem 2: A node has seen a packet with index k if and only if the k th column of the RREF basis B of the 
knowledge space V of the node is a pivot column. 

Proof: The 'if part is clear. For the 'only if, suppose column k of B does not contain a pivot. In any linear 
combination of the rows, rows with pivot after column k cannot contribute anything to column k. Rows with pivot 
before column k will result in a nonzero term in some column to the left of k. So the first nonzero term of any 
vector in V cannot be in column k, i.e., pk could not have been seen. ■ 

Corollary 1: If receiver j has seen packet pk, then it knows exactly one linear combination of the form pk + q 
such that q involves only unseen packets with index more than k. 

Corollary 2: The number of packets seen by a receiver is equal to the dimension of its knowledge space. 

Definition 4 (Witness): We denote the unique linear combination guaranteed by Corollary Q] as Wj(pk), the 
witness for receiver j seeing pk- 

The central idea of the new algorithm is to keep track of seen packets instead of decoded packets. After each 
slot, every receiver informs the sender whether an erasure occurred, using perfect feedback. The aim is to use this 
feedback to ensure the sender stores just enough data to be able to satisfy the innovation guarantee property. The 
two main parts of the algorithm are the coding and queue update modules. 

The coding module computes a linear combination g, which will cause any receiver that receives it, to see its 
oldest unseen packet. First, the sender computes each receiver's knowledge space using feedback and finds its 
oldest unseen packet. Only these packets will be involved in g, and hence we call them the transmit set. A receiver 
can cancel packets involved in g that it has already seen, by subtracting suitable multiples of the corresponding 
witnesses. Therefore, the coefficients for g should be picked such that for each receiver, after canceling the seen 
packets, the remaining coefficient of the oldest unseen packet is non-zero. Theorem [3] proves that this is possible if 
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the field size is at least n. With two receivers, the coding module is a simple XOR based scheme (see Table Q}. The 
coding module readily implies the following queue update rule. Drop a packet if all receivers have seen it, since 
the coding module will not use it ever again. Also, while computing the knowledge spaces of the receivers, the 
sender only needs to track the projection of these spaces on dimensions currently in the queue. Thus, the algorithm 
can be implemented in an incremental manner and the complexity tracks the queue size. 

A. The formal description of the algorithm 

The drop-when-seen algorithm: The algorithm works with the RREF bases of the receivers' knowledge spaces. 
The representation is in the form of coefficient vectors in terms of the current queue contents and not the original 
packet stream. 

1) Initialize matrices B\,.,., B n to the empty matrix. 

2) Incorporate new arrivals: Suppose there are a new arrivals. Add the new packets to the end of the queue. 
Append a zeros to every row in each Bj. 

3) Transmission: If the queue is empty, do nothing; else compute g using the coding module and transmit it. 

4) Incorporate channel state feedback: 
For every receiver j = 1 to n, do: 

If receiver j received the transmission, include the coefficient vector of g in terms of the current queue 
contents, as a new row in Bj. Perform Gaussian elimination. 

5) Separate out packets that all receivers have seen: 
Update the following sets and bases: 

Sj := Set of indices of pivot columns of Bj 
Sa '■= {~Yj = \Sj (set of packets seen by all receivers). 
New Bj := Sub-matrix of current Bj obtained by excluding columns in 5a and corresponding pivot rows. 

6) Update the queue: Drop the packets with indices in Sa- 

7) Go back to step 2 for the next slot. 
The coding module: 

Let {ui, v,2, ■ ■ ■ , u m } be the set of indices of the oldest unseen packets of the receivers, sorted in ascending order 
(rn<n, since the oldest unseen packet may be the same for some receivers). Exclude receivers whose oldest unseen 
packets have not yet arrived at the sender. Let R{ui) be the set of receivers whose oldest unseen packet is p Ui . We 
now present the coding module to select the linear combination for transmission. 

1) Loop over oldest unseen packets 
For j = 1 to m, do: 

All receivers in R(uj) have seen packets p Ui for i < j. Now, Vr £ R(v,j), find y r := YliZi Q iW r (p Ui ), 
where W r (p u .) is the witness for receiver r seeing p u .. Pick a>j £ ¥ q such that otj is different from the 
coefficient of p Uj in y r for each r £ R(uj). 

2) Compute the transmit packet: g := X)"=i a iPut 
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Theorem 3: If the field size is at least n, then the coding module picks a linear combination that will cause any 
receiver to see its oldest unseen packet upon successful reception. 

Proof: First we show that a suitable choice always exists for aj. For r 6 i?(ui), y r = 0. So pick ct\ = 1. 
For j > 1, \R(uj)\<(n — 1). Even if each y r for r G R{uf) has a different coefficient for p Uj , that covers only 
(n — 1) different field elements. If q > n, then there is a choice left in F g for aj. Vj, W S R(uj), receiver r knows 
y r . Now, g and y r have the same coefficient for all packets with index less than Uj, and a different coefficient for 
p Uj . Hence, g — y r will involve p Uj and only packets with index beyond Uj. So r can see p Uj . ■ 

Theorem[2]implies that seeing an unseen packet corresponds to receiving an unknown degree of freedom. Hence, 
Theorem [3] essentially says that the innovation guarantee property is satisfied and hence the scheme is throughput 
optimal. 

B. Connecting the physical and virtual queue sizes 

We will need the following notation: 
S(t) := Set of packets arrived at sender till the end of slot t 

V{t) := Sender's knowledge space after incorporating the arrivals in slot t. This is simply equal to Fg* " 
Vj(t) := Receiver j's knowledge space at the end of slot t 
Sj(t) := Set of packets receiver j has seen till end of slot t 
Lemma 1: For S\, S2, ■ ■ ■ , Sfc (k > 1), subsets of a set S: 

k 

|S|-|nJu#|<X;(|S|-|Si|) (2) 

2=1 

We omit the proof. We apply this lemma on the sets S(t) and Sj(t), j=l to n. Since the queue holds packets not 
seen by all receivers, Q(t) = \S{t)\ - \ C\™ =1 Sj(t)\. Also, from Corollary EJ \Sj(t)\ = dim[Vj(t)]. Hence the RHS 
of (O becomes YHj=i \dim\y{t)] — GKm[Vy(t)]] , which is the sum of virtual queue sizes. This implies the next 
theorem. 

Theorem 4: For Algorithm 2, the physical queue size at the sender is upper-bounded by the sum of the virtual 
queue sizes, i.e., the sum of the degrees-of-freedom backlog between the sender and the receivers. 
Theorem |4] and the result in (UJ lead to this corollary. 



Corollary 3: The expected size of the physical queue in steady state for Algorithm 2 is O 




V. Conclusions and Extensions 

Comparing the results in Theorem Q] and Corollary [3] we see that the queue size for the new Algorithm 2 is 
significantly lower than Algorithm 1 . This will prove useful in reducing congestion. The new algorithm allows the 
physical queue size to track the virtual queue size. This extends stability and other queuing-theoretic results on 
virtual queues to physical queues. 

We believe the proposed scheme will be robust to delayed or imperfect feedback, just like conventional ARQ. 
The scheme readily extends to a tandem network of broadcast links (with no mergers) if the intermediate nodes 
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use the evidence packets in place of the original packets. We expect that it will also extend to other topologies with 
suitable modifications. 

We have proposed a natural extension of ARQ for coded networks. This is the first step towards the goal of using 
feedback on degrees of freedom to control the tradeoff between throughput and decoding delay, by dynamically 
adjusting the extent to which packets are mixed in the network. 
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